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Abstract: We study the design of mechanisms in combinatorial auction domains. We focus on settings 
where the auction is repeated, motivated by auctions for licenses or advertising space. We consider 
models of agent behaviour in which they either apply common learning techniques to minimize the 
regret of their bidding strategies, or apply short-sighted best-response strategies. We ask: when can a 
black-box approximation algorithm for the base auction problem be converted into a mechanism that 
approximately preserves the original algorithm's approximation factor on average over many iterations? 
We present a general reduction for a broad class of algorithms when agents minimize external regret. 
We also present a new mechanism for the combinatorial auction problem that attains an 0{y/Tri) ap- 
proximation on average when agents apply best-response dynamics. 
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1 Introduction 

We consider problems in the combinatorial auc- 
tion (CA) domain, where m objects are to be al- 
located among n potential buyers in order to max- 
imize total value, subject to problem-specific fea- 
sibility constraints. These packing problems are 
complicated by game-theoretic issues: the buyers 
might benefit from misrepresenting their values to 
an allocation algorithm. This prompts us to design 
mechanisms that use payments to encourage rea- 
sonable behaviour The well-known VCG mech- 
anism solves incentive issues by inducing truth- 
telling as a dominant strategy, but is infeasible for 
computationally intractible problems. Indeed, for 
many interesting problems (such as combinatorial 
auctions), there are large gaps between the best- 
known approximation factors attainable by effi- 
cient truthful mechanisms and those possible in 
purely computational settings. For some problems, 
these large gaps are essential [25]. 

In this paper we consider the problem of design- 
ing mechanisms that implement approximations to 
combinatorial auction problems without the use 
of dominant-strategy truthfulness. We are moti- 
vated by the domain of repeated auctions, where 
an auction problem is resolved multiple times with 



the same objects and bidders. These include, for 
example, auctions for advertising spaces or slots 
[13], bandwidth auctions (such as the FCC spec- 
trum auction), airline landing rights auctions [10], 
etc. In these settings a mechanism for the (one- 
shot) auction problem corresponds to a repeated 
game to be played by the agents.' 

The question of how to model agent behaviour 
in repeated games has been studied extensively in 
the economic and algorithmic game theory liter- 
ature (see chapters 17-21 of [24] and references 
therein). Many proposed models suppose that 
agents choose strategies (or distributions there- 
over) at equilibrium, where no agent has incentive 
to unilaterally deviate. There are, however, a num- 
ber of reasons to believe such models are unrealis- 
tic: in general equilibria are computationally hard 
to find, and may not exist without the presence of 
agents who randomize over strategies for no rea- 
son other than to preserve the stability of the sys- 
tem. Even when pure equilibria exist, agents may 
not necessarily converge to an equilibrium (of the 
single-round game) or agree on which equilibrium 

' A simple extension allows preferences and participants to 
change over time, but sufficiently slowly compared to the rate 
of auction repetition. Our results should extend easily to such 
settings. 
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(of the extended-form game) to choose. Such con- 
cerns have also been noted elsewhere [4, 14]. 

We will instead focus our attention on two mod- 
els of agent behaviour that do not make equilib- 
rium assumptions, and have gained recent interest 
in the algorithmic game theory literature. In the 
first model, agents can play arbitrary sequences 
of strategies for the repeated auction, under the 
assumption that they obtain low regret relative to 
the best fixed strategy in hindsight. More pre- 
cisely, the average external regret of such a bidder 
must tend to as the number of auction rounds in- 
creases. These regret-minimizing bidders can be 
seen as agents that learn how to bid intelhgently 
(relative to any fixed strategy benchmark) from the 
bidding history of past auction iterations. Note that 
we require no assumptions about the synchrony or 
asynchrony of updates; arbitrary sets of agents can 
update their strategies concurrently. The regret- 
minimization assumption is reahstic because sim- 
ple, efficient algorithms exist that minimize exter- 
nal regret for linear optimization problems such 
as repeated auctions [19, 20]. Under this model, 
our goal is to design an auction mechanism that 
achieves an approximation to the optimal social 
welfare on average over sufficiently many rounds 
of the repeated auction. This is precisely the prob- 
lem of designing a mechanism with bounded price 
of total anarchy, as introduced by Blum et al [4]. 

In the second model, we assume that agents 
choose myopic best-response strategies to the cur- 
rent strategies of the other agents. Such bid- 
ding behaviour is best motivated in settings where 
agents update their declarations asynchronously 
(ie. not concurrently). We therefore model this 
behaviour as follows: on each auction round, an 
agent is chosen uniformly at random, and that 
agent is given the opportunity to change his strat- 
egy to the current myopic best-response. Under 
this model, our goal again is to design auction 
mechanisms that achieve approximations to the 
best possible social welfare on average over suffi- 
ciently many auction rounds, with high probability 
over the random choices of bidders. This is closely 
related to the concept of the price of (myopic) sink- 
ing, as introduced by Goemans et al [14]. 

Our high-level goal is to decouple computa- 
tional issues from incentives issues. A full (and ad- 
mittedly ambitious) solution in our domain would 



be a black-box conversion of a given approxima- 
tion algorithm into a mechanism that implements^ 
the same approximation ratio, on average over suf- 
ficiently many auction rounds, given our model of 
bidder behaviour. Our primary research question, 
partially addressed herein, is to what extent such 
implementations are possible. 

1.1 Our Contribution 

We design mechanisms that are based on a par- 
ticular class of approximation algorithms for com- 
binatorial auction problems: those that are mono- 
tone and satisfy the loser-independence property. 
An algorithm is monotone if, whenever a bidder 
can win some set 5* by declaring a value of v for 
it, then he could also win any subset of S with 
any declared value at least v. This monotonic- 
ity condition characterizes truthfulness when bid- 
ders are single-minded (meaning that each agent 
has value for only a single set), but not for gen- 
eral auction problems [22]. Roughly speaking, an 
algorithm is loser-independent if the outcome for 
an agent depends only on those agents who would 
win if he did not participate, and on their declared 
values for their winnings. This extends a notion 
of loser-independence for single-parameter prob- 
lems, introduced by Chekuri and Gamzu [8], to 
general auction problems. Many interesting algo- 
rithms satisfy these properties, including greedy 
algorithms for CAs [22] and convex bundle auc- 
tions [1], primal-dual algorithms for unspUttable 
flow [7], and others. 

Our first main result is that any monotone loser- 
independent c-approximate algorithm can be im- 
plemented as a mechanism with price of total an- 
archy at most c(l + o(l)). Our mechanism is a 
black-box reduction from an algorithm for a one- 
shot auction iteration, and the same mechanism is 
applied each auction round. The form of our mech- 
anism is very simple: on each round, it applies a 
simple modification to the bidders' declarations, 
then runs the approximation algorithm on the mod- 
ified declarations and charges critical prices (i.e. 
an agent who wins a set pays the smallest amount 
he could have declared for that set and won it, 
given the declarations of the other bidders). 

^Throughout the paper we use "implement" in the economic 
sense of obtaining the desired properties when used by rational 

agents. 
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Our implementation does not depend on the spe- 
cific algorithms used by the agents to minimize 
their regret; only that their regret vanishes as the 
number of rounds increases. The rate of conver- 
gence to our approximation bound will depend on 
the rate at which the agents' regret vanishes. 

We demonstrate that our mechanism is resilient 
to the presence of byzantine agents, in the fol- 
lowing sense. If each agent either applies regret- 
minimizing strategies or makes arbitrary declara- 
tions (but never declares more than his true value 
for a set), then the mechanism attains a c(l -|- 
0(1)) approximation to the optimal welfare obtain- 
able by the regret-minimizing bidders. The no- 
overbidding assumption is necessary (as otherwise 
a byzantine agent could bid arbitrarily highly and 
prevent any welfare from being obtained) and re- 
alistic, since we can view byzantine players as not 
understanding how to participate intelligently in 
the auction and thus likely to bid conservatively. 

We conjecture that the mechanism described 
above also implements an 0(c) approximation, 
on average over sufficiently many rounds, in the 
model of best-response bidders. Whether this is so 
is an important open question. 

We then focus specifically on the general com- 
binatorial auction problem in the best-response 
model. Specifically, we present a mechanism that 
implements an 0{s) approximation for combina- 
torial auctions with set allocations of size at most 
s, then extend this to a mechanism that implements 
an 0(\/m) approximation for general combinato- 
rial auctions. Note that this approximation fac- 
tor is the best possible, in that we attain it with 
high probability after polynomially many auction 
rounds (in fact, only a slightly superlinear num- 
ber of rounds). We point out that while truth- 
ful mechanisms with similar approximation ratios 
are known for single-minded combinatorial auc- 
tions, our results are significant improvements over 
what is known to be achieveable with deterministic 
truthful algorithms. 

Our results require a mild game-theoretic as- 
sumption, which is that bidders will not apply 
strategies that are (strictly) dominated by easily- 
found alternatives. This is precisely the assump- 
tion of algorithmically undominated strategies, as 
introduced by Babaioff et al [2]. Additionally, 
the mechanism for best-response bidders also ap- 



plies a technique known in implementation the- 
ory as virtual implementation, where an alterna- 
tive social choice rule is applied with vanishingly 
small probability [18]. We view this not as an in- 
troduction of randomness into the algorithm be- 
ing implemented, but rather as the introduction 
of a trembling-hand consideration into the solu- 
tion concept that encourages reasonable behaviour 
when best-response agents must distinguish be- 
tween otherwise equally beneficial strategies. 

1.2 Regret Minimization 

We now describe the concept of external regret 
minimization in further detail. The external regret 
of a sequence of declarations is the difference be- 
tween the average utility of an agent (i.e. value 
of goods received minus payment extracted) and 
the maximum average utility that could have been 
obtained by a single fixed declaration made each 
round. An algorithm for generating declarations is 
regret-minimizing if its regret vanishes as a func- 
tion of the number of auction rounds. 

A simple and efficient algorithm due to Kalai 
and Vempala [20] minimizes regret for hnear op- 
timization problems, even when the strategy space 
has exponential size. The algorithm requires ac- 
cess to an exact best-response oracle. Kakade et 
al [19] show how to use a 7-approximate best re- 
sponse oracle to achieve a 7-approximation to the 
best fixed declaration in hindsight. 

The regret-minimization mechanism we con- 
struct in this paper will reduce the strategic choices 
of bidders to a simple linear optimization prob- 
lem, so that the algorithms described above can 
be used by agents to minimize external regret. 
This requires access to best-response oracles for 
the approximation algorithms being implemented. 
The ability to compute best-response choices is 
also a necessary component for the setting of best- 
response bidders, where the best-response assump- 
tion certainly requires that such strategies can be 
found efficiently. It is easy to compute best- 
responses when agents choose between only poly- 
nomially many strategies (such as, for example, 
when each agent's true valuation is a combina- 
tion of polynomially many desired bundles). In 
general, however, the problem of computing best- 
response is non-trivial when agents have exponen- 
tially many strategic choices; we leave the con- 
struction of such oracles to the creators of particu- 
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lar auction mechanisms. 

1.3 Related Work 

Truthful mechanisms for the combinatorial auc- 
tion problem have been extensively studied. For 
general CAs, Hastad's well-known inapproxima- 
bility result [16] shows that it is hard to approx- 
imate the problem to within ^l{^/rn) assuming 
NP ^ ZPP. The best known deterministic 
truthful mechanism for CAs with general valua- 
tions attains an approximation ratio of 0(-^j==) 

[17]. A randomized 0(v^)-approximate mech- 
anism that is truthful in expectation was given 
by Lavi and Swamy [21]. Dobzinski, Nisan and 
Schapira [12] then gave an O(y^) -approximate 
universally truthful randomized mechanism. 

Many variations on the combinatorial auction 
problem have been considered in the literature. 
Bartal et. al. [3] give a truthful O(m^) 
mechanism for multi-unit combinatorial auctions 
with B copies of each object, for all B > 3. 
Dobzinski and Nisan [11] construct a truthful 2- 
approximate mechanism for multi-unit auctions. 
Many other problems have truthful mechansisms 
([7, 22, 23]) when bidders are restricted to being 
single-minded. In [5] the authors study the limited 
power of certain classes of greedy algorithms for 
truthfully approximating CA problems. 

The problem of designing combinatorial auc- 
tion mechanisms that implement approximations 
at equihbria (and, in particular, Bayes-Nash equi- 
libria for partial information settings) was consid- 
ered in [9] for submodular CAs, and in [6] for gen- 
eral CA problems. Implementation at equiUbrium, 
especially for the alternative goal of profit maxi- 
mization, has a rich history in the economics hter- 
ature; see, for example, Jackson [18] for a survey. 

The study of regret-minimization goes back to 
the work of Hannan on repeated two-player games 
[15]. Kalai and Vempala [20] extend the work 
of Hannan to online optimization problems, and 
Kakade et al [19] further extend to settings of ap- 
proximate regret minimization. Blum et al [4] ap- 
ply regret-minimization to the study of inefficiency 
in repeated games, coining the phrase "price of to- 
tal anarchy" for the worst-case ratio between the 
optimal objective value and the average objective 
value when agents minimize regret. 

Properties of best-response dynamics in re- 



peated games, and especially the question of con- 
vergence to a pure equilibrium, is well-studied (see 
Chapter 19 of [24]). The study of average perfor- 
mance of best-response dynamics as a metric of 
game inefficiency, the so-called "price of sinking," 
was introduced by Goemanns et al [14]. 

Babaioff et al [2] study implementation of al- 
gorithms in undominated strategies, which is a re- 
laxation of the dominant strategy truthfulness con- 
cept. They focus on a variant of the CA problem in 
which agents are assumed to have "single-value" 
valuations, and present a mechanism to implement 
such auctions in a multi-round fashion. By com- 
parison, mechanisms in our proposed model solve 
each instance of an auction in a one-shot manner, 
and our solution concept assumes that the auction 
is repeated multiple times. 

2 ]VIodel and Definitions 

In general we will use boldface to represent vec- 
tors, subscript i to denote the ith component, and 
subscript —i to denote all components except i, so 
that, for example, v = (w^, v_i). 

We consider the domain of combinatorial auc- 
tion problems, where n agents desires subsets of 
a set M of m objects. An allocation profile is 
a collection of subsets Xi, . . . , X„, where Xi is 
thought of as the subset allocated to agent i. A 
particular problem instance is defined by the set of 
feasible allocation profiles that are permitted; for 
example, the general combinatorial auction prob- 
lem requires that all allocated subsets be disjoint. 
Each agent i has a privately-held valuation func- 
tion ti : 2^ R, his type, that assigns a value 
to each allocation. We assume that valuation func- 
tions are monotone and normahzed so that v(fi) = 
0. A valuation function v is single-minded if there 
exists 5 C M and X > such that v{T) = x if 
S C T and v{T) = otherwise. We will write 
for the zero valuation, and {S, x) for a single- 
minded declaration for S at value x. 

An allocation rule A assigns to each valuation 
profile V a feasible outcome A{v); we write Ai (v) 
for the allocation to agent i. We write A for both an 
allocation rule and an algorithm that implements it. 

An allocation rule is loser-independent if, when- 
ever v'_j satisfy ^(0, v_,) = ^(0, v'_ J and 
Vj{Aj{lI},^^-^)) = vj(^(0,v'_,)) for all i ^ i, 
then A{vi,-v-i) = A{vi,V_^). In other words, 
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agent i's perception of the behaviour of A depends 
only on those agents who would win if agent i did 
not participate, and on their declared values for 
their winnings. 

A payment rule P assigns a vector of n pay- 
ments to each valuation profile. A direct revela- 
tion mechanism M is composed of an allocation 
rule A and a payment rule P. The mechanism pro- 
ceeds by eUciting a valuation profile d of decla- 
rations from the agents, then applying the alloca- 
tion and payment rules to d. The utihty of agent 
i for mechanism M., given declaration profile d, 
is Uj(d) = U{Ai{d)) - Pi{d). We think of each 
agent as wanting to choose di to maximize Ui{d). 

The social welfare obtained by allocation pro- 
file X, given type profile t, is S'W^(X,t) = 
^jti(Xj). Given fixed type profile t, we write 
SWopt for maxx{S'VF(X,t)}, and SWA{d) = 
T.^U{A{d)). When D = (d^, d^, . . . , d^) 
is a sequence of valuation profiles, we write 
SWa{D) = ^ Et SWA{d') for the average wel- 
fare obtained over all declarations in D. We will 
sometimes replace subscript ^ by A^, in which 
case the social welfare is for the allocation rule of 
M. Note that algorithm ^ is a c-approximation if 
SWA{t) > i^V^optforallt. 

Given allocation rule A, agent i, declara- 
tion profile d_i, and set S, the critical price 
0f{S,d-i) for 5 is the minimum value that agent 
i could bid on set S and be allocated S by A 
given fixed d_i. That is, d_i) = 'm{{v : 
3di,d,{S) = v,A,{d,,d^,) = S}. 

We say that a declaration di is weakly dominated 
by declaration d/ for agent i if Ui{di,d-i) < 
Ui{di,d^i) for all d i, and Ui{di,d-i) < 
Ui{di , d-i) for some d_j. 

Declaration sequence D = (d°, d^ , . . . , d^) 
minimizes external regret for agent i if, for 
any fixed declaration di, ^tUi{di^ ,d*_^) > 
J2tUi{di,dti)-\-o{T). That is, the utiUty of agent 
/ approaches the utility of the optimal fixed strat- 
egy in hindsight. 

Declaration sequence D = (d°, d^ , . . . , d^) 
is an instance of response dynamics if, for all 
1 < t < T, profiles d*~^ and d* differ on the 
declaration of at most one player. Response dy- 
namics D is an instance of best-response dynam- 
ics if, whenever d*~^ and d* differ on the dec- 
laration of agent i, d\ maximizes agent i's utility 



Mechanism M.X- 

Input: Declaration profile d = di, . . . , rf„. 

1. d' <— SIMPLIFY(d). 

2. Allocate A{d'), charge critical prices. 

Procedure SIMPLIFY : 

Input: Declaration profile d = di, . . . , rf„. 

1. For each i G [n]: 

2. Choose S'j e argmax(j{(ii (5)}, breaking 
ties in favour of smaller sets. 

3. di' ^{Si,di{Si)). 

4. Return {d'l, . . . 



Figure 1: Mechanism for regret-minimizing bidders, 
based on monotone algorithm A. Uses subprocedure 

SIMPLIFY. 



given the declarations of the other bidders. That 

is, dl e argmax^{Mj(d, dij}. 

3 Regret-Minimizing Bidders 

In this section we prove that if agents avoid al- 
gorithmic ally dominated strategies and minimize 
external regret, then a loser-independent monotone 
algorithm A can be converted into a mechanism 
with almost no loss to its average approximation 
ratio over many rounds. The mechanism. Ma, is 
described in Figure 1. Mechanism Ma proceeds 
by first simplifying the declaration given by each 
agent, then passing the simplified declarations to 
algorithm A. The resulting allocation is paired 
with a payment scheme that charges critical prices. 

The simplification process SIMPLIFY essen- 
tially converts any declaration into a single-minded 
declaration (and does not affect declarations that 
are already single-minded). We will therefore as- 
sume without loss of generality that agents make 
single-minded declarations, as additional informa- 
tion is not used by the mechanism.^ 

Fix a particular combinatorial auction problem 
and type profile t, and let A be some monotone 
approximation algorithm. Let d be a declaration 

'We note, however, that this is not the same as assuming 
that agents are single-minded; our results hold for bidders with 
general private valuations. 
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profile; we suppose each di is a single-minded bid 
for set Si. We draw the following conclusion about 
the bidding choices of rational agents. 

Lemma 3.1. Declaration di is an undominated 
strategy for agent i if and only ifdi{Si) = ti{Si). 

Proof. For all d_i, A^^((ii,d_i) either allocates 
Si or to agent i. Thus agent i's utility for declar- 
ing di, Ui{di, d-i), is ti{Si)-6f^-^ {Si, d_i) when 
> 0f^'^{S^,d_,), and otherwise. A dec- 
laration of di{Si) = ti{S) therefore maximizes 
Ui{di,d-i) for all d-i. 

On the other hand, if di{Si) ^ ti{Si), let d/ 
be the single-minded declaration for Si at value 
ti{Si). Then for any d-i such that 9i'^{Si, d-i) 
lies between di{Si) and ti{Si), ?ii((i/,d_i) > 
Ui{di,d-i). For simplicity we will assume such 
a d-i exists; handUng the general case requires 
only a technical and uninteresting extension of no- 
tation"*. Thus declaration d/ weakly dominates 
declaration di. □ 

One implication of Lemma 3.1 is that the strate- 
gic choice of an agent participating in mechanism 
Ma reduces to a hnear optimization problem. On 
each round, we can think of agent i as choosing 
set Si, which is the set he will attempt to win that 
round. Once Si is chosen, an undominated decla- 
ration for agent i is determined: the single-minded 
declaration for Si at value ti{Si). Given that agent 
i chooses set Si, his utility will be ti{Si) — Wi, 
where Wi = mm{ti{Si), Of {Si, d-i)} is the price 
for set Si, determined by the declarations of the 
other agents, capped at ti{Si). Thus, since utilities 
are linear in the choice of Si, agents can indeed 
apply the regret-minimization algorithm of Kalai 
and Vempala [20] to choose strategies that mini- 
mize external regret. 

We now proceed with bounding the social wel- 
fare obtained by Ma- Let Ai,. . . ,An be an op- 
timal assignment for types t. Suppose that D = 
d'^ , . . . , d^ is a sequence of declarations to our 
mechanism. The definition of regret minimization 
then unmediately impUes the following. 

''if 6i'^{Si,d-i) never lies between di{Si) and ti{Si)) 
for any d-i, then MA{di,d-i) = MA{di' ,d-i) for all 
d-i, so di and di' are equivalent strategies. We can therefore 
think of di as being "the same" as a single-minded declaration 
for Si at value ti(Sj). We will ignore this technical issue for 
the remainder of the paper, in the interest of keeping the expo- 
sition simple. 



Lemma 3.2. If agent i minimizes his external re- 
gret in bid sequence D, then y ^ti^i{A{d*)) + 
ei^{Ai,dU))>ti{Ai)-o{l). 

Assume now that algorithm A is loser indepen- 
dent. We can then relate the value of the solution 
returned by an algorithm to the critical prices of 
the sets in an optimal solution. 

Lemma 3.3. If A is a monotone loser-independent 
c-approximate algorithm, then ^idi{A{d)) > 
lEiOf{Ai,d-i). 

Proof. Choose some e > 0. For each i, 
let di be the pointwise maximum between di 
and the single-minded declaration for set Ai 
at value 9f{Ai,d^i) — e. The loser inde- 
pendence property implies that we can per- 
form this operation independently for each agent 
(i.e. without affecting critical prices), and 
moreover A{d') = A{d). Since A is 
a c-approximate algorithm, di{A{d')) > 
}X;d,'{A{) > \ Y.^{Of{A,d^,) - e). Addi- 
tionally, since di{T) = di{T) whenever di{T) > 
ef{T,d-i) (from the definition of d-), we have 
di'{A{d')) = d,{A{d)) for all i. We conclude 
thatEirfi(^(d)) > \ T.i{Ot{Ai,d-i) - e) for 
all e > 0, as required. □ 

We are now ready to proceed with the proof of 
our main result in this section. 

Theorem 3.4. Any monotone loser-independent c- 

approximate algorithm can he implemented as a 
mechanism with c + 1 price of total anarchy. 

Proof. Let D = d^ , . . . , d^ be a sequence of 
declarations in which all agents minimize exter- 
nal regret. By Lemma 3.2, y M(d*)) + 
ef{Ai,d^_i)) > ti{Ai) - o(l). Summing over all 
i, we have ^ Et Ei(tiM(d*)) + ^i^(Ai, d*_,)) > 
SWoPT — (")(o(l)). By Lemma 3.3, this implies 
TEtEi(ii(^(d*)) + cd*(^(d*))) > SWoPT- 
(n)(o(l)). We know dl{A{d*)) = ti{A{d*)) for 
all i and t by Lemma 3.1, so we conclude (c + 
l)TEtEii.(^(d*)) > SWoPT - {n){o{l)). 
Since the term hidden by the asymptotic notation 
vanishes with T and does not depend on n, we ob- 
tain the desired result. □ 

Theorem 3.4 is very general, as it appUes to 
a number of different problem settings in which 



6 



loser-independent monotone approximation algo- 
rithms are known. As a few particular examples, 
Theorem 3.4 yields an 0{y/rn) implementation of 
the combinatorial auction problem [22], an s + 1 
implementation of the combinatorial auction prob- 
lem where sets are restricted to cardinality s (using 
a simple greedy algorithm), an 0{ra^/'^^^'^^ ) im- 
plementation of the unsplittable flow problem with 
minimum edge capacity B [7], and an 0(i?*/^) 
implementation of the combinatorial auction of 
convex bundles in the plane where R is the maxi- 
mum aspect ratio over all desired bundles [1]. 

We note that, since agents experience no regret 
at a pure Nash equilibrium, an irmnediate corol- 
lary to Theorem 3.4 is that any monotone loser- 
independent c-approximate algorithm can be im- 
plemented as a mechanism with c + 1 price of 
anarchy. We remark that an alternative proof of 
this result has been given recently using a different 
mechanism construction [6]. 

Also, the rate at which the welfare obtained by 
Ma converges to an average that is a c + 1 ap- 
proximation to optimal depends on the rate of con- 
vergence of players' external regret to 0. The av- 
erage welfare obtained after T rounds will have an 
additive loss of (n)(r(T)), where r(T) is the aver- 
age regret experienced by an agent after T rounds. 
Assuming that agents apply algorithms that mini- 
mize regret at a rate of r(T) = o{1/^/T), which is 
easily attainable using the algorithm of Kalai and 
Vempala [20], the additive error term is at most a 
constant when T is at least quadratic in n. 

3.1 Resilience to Byzantine Agents 

Suppose that in addition to regret-minimizing 
agents, the auction participants include byzantine 
agents. The only restriction we impose on the be- 
haviour of such agents is that they do not overbid 
on any set; that is di{S) < ti{S) for any S and 
byzantine agent i. We can motivate this restric- 
tion either through our characterization of undom- 
inated strategies in Lemma 3.1, or by thinking of 
byzantine players as not understanding how to par- 
ticipate rationally in the auction, and hence likely 
to be conservative in the way that they participate. 
Under this assumption, since Lemma 3.3 holds for 
any declaration profile, we easily obtain the fol- 
lowing generalization to Theorem 3.4. 



Proposition 3.5. Suppose A is a monotone loser- 
independent c-approximate algorithm and D is a 
declaration sequence for M.j\.. IfN C [n] is a col- 
lection of agents that minimize regret in D, and the 
remaining agents never bid more than their true 
values on any set in D, then y SWj({d*) > 
^Eieiv^W^opt + IA^IKl)). 

3.2 Importance of Loser-Independence 

We note that the loser independence property is 
necessary for Theorem 3.4, as the following exam- 
ple demonstrates. 

Example 3.6. Consider an auction problem in 
which no agent can be allocated more than s ob- 
jects, and moreoever M = AVJ B where \A\ = 
\B\ = m,/2 and the mechanism must either allo- 
cate objects in A or objects in B, but not both. 
Consider the algorithm that takes the maximum 
over two solutions: a greedy assignment of sub- 
sets of A, and a greedy assignment of subsets of 
B. This algorithm obtains an s -|- 1 approximation. 

Consider now an instance of the problem in 
which a single agent desires all of B with value 
1, and each of m/2 agents desires a separate sin- 
gleton in A with value 1 — e. Suppose that the 
agent desiring B declares his valuation truthfully, 
but the other agents declare the zero valuation. On 
this input, the algorithm under consideration ob- 
tains only an m/2 approximation to the optimal 
solution. However, this set of declarations forms a 
Nash equilibrium, and hence each agent has zero 
regret under this input profile. Thus, even if agents 
minimize their regret, our mechanism may obtain a 
very poor approximation to the optimal social wel- 
fare over arbitrarily many auction rounds. 

4 Best-Response Agents 

In this section we consider the problem of de- 
signing mechanisms for agents that apply myopic 
best-response strategies asynchronously. Recall 
that in our model agents are chosen for update uni- 
formly at random, one per round. In order to keep 
our exposition clear, we will make two additional 
assumptions about the nature of the best-response 
behaviour (which can be removed, as we discuss 
below). First, we will suppose that in the ini- 
tial state every bidder makes the empty declaration 
0. Second, we suppose that if a bidder is chosen 
for update but caimot improve his utility, he will 
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choose to maintain his previous strategy. These 
assumptions will simplify the process of charac- 
terizing best-response strategies of agents, and in 
particular the statement of Lemma 4.3 in the next 
section. It is possible to remove these assumptions, 
at the cost of a minor modification to the mecha- 
nisms we propose. We defer a more complete dis- 
cussion to the appendix. 

A simple example shows that mechanism Ma 
may not converge to a Nash equilibrium via best- 
response dynamics; this example is presented in 
the appendix. We conjecture that, on average, the 
best-response dynamics on mechanism A4a ob- 
tains a good approximation to the optimal social 
welfare. 

Conjecture 4.1, If A is a monotone loser- 
independent c-approximate algorithm, then M.a 
has 0{c) price of (myopic) sinking. 

As partial progress toward resolving Conjecture 
4.1, we construct alternative mechanisms that are 
more amenable to best-response analysis. These 
mechanisms are tailored specifically to the general 
combinatorial auction problem, and combinatorial 
auctions with cardinality-restricted sets. Our hope 
is to demonstrate the intuition behind Conjecture 
4. 1 and explore mechanism design tools that may 
prove useful in its resolution. 

The primary tool we will use is the following 
probabilistic lemma, which pertains to any mech- 
anism in a best-response setting. Suppose is a 
mechanism, and D is a sequence of best-response 
declarations for Ai. For any d, let Pi{d) = 
Pi(d_i) be some property of d that does not de- 
pend on di, and let P2 (d) = P2{di) be some prop- 
erty depending only on di. 

Lemma 4.2. Suppose that, for any d, if PiiA-i) 
is false, then any best response by agent i, di, 
satisfies P2{di). Then for all e > 0, if best- 
response dynamics is run for T > e~^n steps, 
there will be at least — e)T steps t for which 
either Pi(d*_J or P2(c?i*) is true, with probability 
at least 

Proof (sketch). Let Bj be the event that neither 
Pi(d'L,i) nor P2(dj*) is true, and let A* denote the 
event that P2 (d/) is true. Our goal is to bound the 
number of occurrances of B*. 



Note that if P* occurs and agent i is chosen for 
update on step t + 1, then occurs (by assump- 
tion). Alternatively, if Aj occurs but agent i is not 
chosen for update on step t+1 then A*"*"^ occurs, 
since Ai depends only on the declaration of agent 
i. Thus events A* and Bj can be compared to a ran- 
dom walk on {0, 1}, where at each step the current 
state changes with probability 1/n. The number 
of occurrances of Bf is dominated by the num- 
ber of occurrances of in such a random walk. 
A straightforward application of the method of 
bounded average differences shows that this value 
is concentrated around its expectation, which is at 
most ^ + f • Thus, as long as T > e~^n, the 
number of occurrances of will be concentrated 
at r(i + e), giving the desired bound. Additional 
details are deferred to the appendix. □ 

4.1 A Mechanism for s-CAs 

Consider the s-CA problem, which is a combi- 
natorial auction in which no agent can be allocated 
more than ,s objects. An algorithm that greedily as- 
signs sets in descending order by value obtains an 
(s -I- 1) approximation.^ Call this algorithm AsCA ■ 
We will construct a mechanism AdsCA based on 
AsCA', it is described in Figure 2. This algorithm 
simplifies incoming bids (in the same way as Ma) 
and runs algorithm AsCA to find a potential allo- 
cation. However, an additional condition for in- 
clusion in the solution is imposed: the value de- 
clared for a set must be larger than the sum of all 
bids for intersecting sets. Potential allocations that 
satisfy this condition are allocated, and the mech- 
anism charges critical prices (that is, the smallest 
value at which an agent would be allocated their 
set by MsCA, which is not necessarily the same 
as the critical price for AsCa)- 

We note that since our mechanism implements 
a monotone algorithm and charges critical prices. 
Lemma 3.1 implies that undominated strategies for 
agent i involve choosing a set Si and making a 
single-minded bid for Si at value ti{Si). We will 
therefore assume that agents bid in this way. 

Suppose that d is a declaration profile, where 
each di is single-minded for Si. For any set T, de- 
fine P,(d, T) = {j : j y^i^SjHT 0}. We also 
define Qi(d,T) = {j : j G Ri{d,T),dj{Sj) < 
ti{T)}. That is, Ri is the set of bidders other than 

'And an s approximation for single-minded declarations. 
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Mechanism MsCA- 



Input: Declaration profile d = di, . . . , d„. 

1. d' <- SIMPLIFY(d), say dj' = {Si,Vi) 

2. (ri,...,r„) ^ AcA(d'). 

3. For each i such that Ti 

4. R^{j:Sjr\Ti^%}. 

6. If di'{Ti) < Pi, set Ti ^%,pi^ 0. 

7. Allocate Ti, . . . , T„, charge critical prices. 



Figure 2: Mechanism MaCA, an implementation of 
greedy algorithm AsCA for the s-CA problem. 



i whose single-minded declared sets intersect T, 
and Qi is the subset of those bidders whose single- 
minded declared values are less than agent i's true 
value for T. We then say that d is separated for 

agent i if J2j(^Qi(d,Si) ^ji^j) ^ and d is 

separated if it is separated for every bidder. Since 
an agent gains positive utihty only if the declara- 
tion is separated for him, and since the intial state 
is the empty declaration profile (which is sepa- 
rated), we draw the following conclusion. 

Lemma 4.3. At each step of the best-response 
dynamics for mechanism AdsCA, the declaration 
profile submitted by the agents will be separated. 

For the remainder of the section we will assume 
that declaration profiles are separated. Under this 
assumption, the behaviour of mechanism M.sCA 
simplifies in a fortuitous way. 

Proposition 4.4. If d is separated, then MsCA 
allocates Si to agent i precisely when di{Si) > 
maxjeiJi(Si,d) dj{Sj). 

Proof If di{Si) > ma.Xj^ii.(^Si,d)dj{Sj), 
then Si is allocated by AsCA{<i)- Fur- 
thermore Qi{Si,d) = Ri{Si,d), so 
di{Si) > T,j&R,(s„d)dj{Sj) implies 
di{Si) > Y.jeni(Si,d)d]{Sj) and hence Si 
will be allocated by MsCA- On the other hand, 
if di{Si) < maxjg^.(5._d) (ij(5'j), then cer- 
tainly d^{S^) < T,jeRt{St,d)dj{Sj) so Si is not 
allocated by MsC a- □ 

Let Ai,. . . ,An be an optimal allocation with 
respect to the agents' true types t. 



Proposition 4.5. If d is separated and 
J2jeR,{d,A,)dj{Sj) < ^ti{A,), then any 
utility-maximizing declaration for agent i, dt, will 
be a single-minded declaration for some Si with 

d^iSi) > lU{Ai). 

For declaration profile d, let G denote the set of 
agents i for which either X^jgfl.(d,Ai) dj{Sj) > 
^ti{At) ov di{Si) > ^ti{Ai). We now bound the 
social welfare obtained by MsCA with respect to 
the optimal assigrmient to agents in G. 

Lemma 4.6. 

We are now ready to bound the average social 
welfare of our mechanism, over sufficienfly many 
rounds, with respect to the approximation factor of 
algorithm A. 

Theorem 4.7. Choose e > and suppose D = 
d^, . . . ,dF is an instance of best-response dynam- 
ics with random player order, where agents play 
undominated strategies, andT > e~^n. Then 



SW^ 



MsC. 



e]SWopt{t) 



.8(s+l) 

with probability at least 1 — ne~^^^/^^". 

Proof Let Gj be the set of agents G from Lemma 
4.6 on step t (i.e. with respect to declaration 
d*). Lemma 4.2 and Proposition 4.5 together im- 
ply that each agent i will be in Gt for at least 
(i — e)T values of t, with probability at least 
1 — e^^*^ /32n rpj^g union bound then implies that 
this occurs for every agent with probability at least 
1 — ne~^' /32n Conditioning on the occurrance 
of this event, Lemma 4.6 implies 

SWmscAD) = ^^SWMscAd') 



> 



> 



> 



1 


4(s 


+ 1)T 




1 


4(s 




f 


1 



EE 



.8(s+l) 
which implies the required bound. 



e I ti{Ai 
e ] SWoptit) 



□ 
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Mechanism Mca'- 

Input: Declaration profile d = di, . . . ,dn. 

1. d' ^ SlMPLlFY(d), say = {Si,Vi) 

2. With probability 7: 

3. For alH with Si = M, rf/ ^ 0. 

4. Let in,...,Tn)^M^cA{d'). 

5. If 3i :Si = M: 

6. hetj <— axgm.ax.j{d'j{M) : Sj = M}. 

7. lfdr{S,)>Ek^r.s,=Md'k{Sk)^d 

d'j{Sj)>j:idfi{Tiy- 

8. Set Tj ^ M, Tj <- for all i ^ j 

9. Allocate Ti , . . . , T„, charge critical prices. 



Figure 3: Mechanism A^ca, a best-response implemen- 
tation of a greedy algorithm for the CA problem. Param- 
eter 7 > is an arbitrarily small positive constant. Note 
M^CA is MsCA from Figure 2 with s = ^fm. 



Taking, say, e = 0.1, and assuming T » n, 
we conclude that SWm^ca i^) > o}l)^^opt{t) 
with high probability. Thus AisCA implements an 
0{s) approximation to the s-CA problem for best- 
response bidders. 

4.2 A Mechanism for General CAs 

Consider the following algorithm for the general 
CA problem: try greedily assigning sets, of size at 
most y/m, by value; return either the resulting so- 
lution or the allocation that gives all items to a sin- 
gle agent, whichever has higher welfare. This al- 
gorithm is an 0{y/m) approximation [23]. We will 
construct a mechanism ^AcA based on this algo- 
rithm; it is described in Figure 3. A4ca essentially 
implements two copies of A4sCA- one for sets of 
size at most (which we will call Al^p^), 
and one for allocating all objects to a single bidder; 
it then takes the maximum of the two solutions. We 
add one additional modification: with vanishingly 
small probability 7, AicA ignores bids for M and 
behaves as A^^ca- Th^ purpose of this modifi- 
cation is to encourage agents to bid on small sets, 
even when the presence of a high-valued bid for 
a large set would seem to indicate that bidding on 
small sets is fruitless. 

The analysis of the average social welfare ob- 
tained by Mca closely follows the analysis for 
MsCA- Our high-level approach is to apply this 



analysis twice: once for allocations of sets of size 
at most ^/m, and once for allocations of all objects 
to a single bidder. The primary compUcating fac- 
tor is that the bidding choice of an agent may be 
influenced by the mechanism's choice of whether 
or not to allocate M to a single bidder; this can be 
handled by a careful analysis of utility-maximizing 
declarations. We defer all details to the appendix. 
The final result is the following. 

Theorem 4.8. Choose e > and suppose D = 

d^ , . . . ,d?" is an instance of best-response dynam- 
ics with random player order, where agents play 
undominated strategies, andT > e~^n. Then 

SWmca{D) > {of^ - SWoptit) 

with probability at least 1 - 2ne~'^^^/^^". 

We conclude that mechanism Mca implements 
an 0{^/rn) approximation to the combinatorial 
auction problem for best-response bidders, with 
high probability, whenever T » n. 

5 Conclusions and Future Work 

We considered the problem of designing mech- 
anisms for use with regret-minimizing and best- 
response bidders in repeated combinatorial auc- 
tions. We presented a general black-box construc- 
tion for the regret-minimization model, which im- 
plements any monotone loser-independent approx- 
imation algorithm. For the best-response model, 
we constructed an O(y^) -approximate mecha- 
nism for the combinatorial auction problem. 

The most obvious direction for future research 
is to extend our results to implement additional 
algorithms. Our best-response mechanisms made 
specific use of the structure of greedy CA algo- 
rithms, but it seems likely that our approach can 
be generaUzed. Another specific question of note 
is whether Conjecture 4.1 is true, and the mech- 
anism we proposed for regret-minimizing bidders 
also yields good performance when used by best- 
response bidders. Our techniques appear Umited 
to loser-independent algorithms; can algorithms 
that are not loser-independent be implemented in 
regret-minimization or best-response domains? A 
broader research topic is to explore other models 
for reasonable bidder behaviour, which may admit 
different mechanism implementations. 
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Appendix A: Proof of Proposition 3.5 

Recall the statement to be proven. We suppose 
^ is a monotone loser-independent c-approximate 
algorithm and D is a declaration sequence for 
A4a- If iV C [n] is a collection of agents that 
minimize regret in D, and the remaining agents 
never bid more than their true values on any set 
in D, then we must show that ^ J2t SW^{d*) > 
-^EieNtMi) + l^l(o(l)) for any allocation 
profile Ai,. . . , An- 

We proceed with the proof. By Lemma 3.2 and 
summing over all i G N, 

Y,U{A,)-\N\ioil)) 

<^^^iUiA{d')) + Oi{A„dU)) 
t ieN 

t ie[n] 

By Lemma 3.3, this implies 

^E.ZMAd')) + cdl{A{d'))) > 

E,eArt»(^i) - |-^^|(o(l)). The result then 
follows from the fact that c?f(>l(d*)) < tl{A{d*)) 
for all i. 

Appendix B: Assumptions on the Best- 
Response Model 

Recall that in our model of best-response dy- 
namics, we assumed that in the initial state every 
bidder makes the empty declaration 0, and that if 
a bidder is chosen for update but cannot improve 
his utiUty, he will choose to maintain his previous 
strategy. We used these assumptions to argue that 
agents make only separated declarations when par- 
ticipating in mechanisms Msca and Mca- 

These assumptions can be removed, as follows. 
We can modify mechanisms A4sCA and A4ca so 
that, with vanishingly small probabiUty, an alter- 
native allocation rule is used. This alternative rule 
chooses an agent at random, and assigns him all 
objects at no cost as long as the input declaration 
is separated for that agent. Thus, any separated 
declaration by agent i results in positive expected 
utiUty. Then, since any non-separated declaration 
by an agent results in a utility of for that agent, it 
must be that the utility-maximizing declaration by 
any agent must be a separated declaration. 



It follows that after each bidder is chosen at least 
once for update, and every step thereafter, the in- 
put declaration will be separated. Thus, with high 
probability, every declaration after O(nlogn) 
steps will be separated (by the coupon-collector 
problem). Lemma 4.3 will therefore hold after 
O(nlogn) steps of best-response dynamics, with 
high probability; the remainder of the analysis can 
then proceed without change. 

Appendix C: Best-response Dynamics 
on Ma might not Converge 

Consider a combinatorial auction problem 
where each agent can receive at most 2 items. Let 
A be the greedy allocation rule that allocates sets 
greedily by value. Suppose there are 6 agents and 
4 objects, say {a, 6, c, d], and that the agents' true 
valuations are given by the following set of bids 
(where the value for a set not listed is taken to be 
the maximum over its subsets). 



player 


set 


value 


1 


{a, 6} 


4 


1 


{d} 


6 


2 


{a} 


2 


2 


{6,c} 


5 


3 


{c} 


4 


4 


{d} 


5 



Since agents 3 and 4 are single-minded, they 
always maximize their utility by declaring values 
truthfully, so we can assume in any sequence of 
best-response moves that they do so. By contrast, 
players 1 and 2 each have a strategic choice to 
make each round: which of their two desired sets 
should they bid upon? Note that once this decision 
is made, the way to bid is determined by Lennma 
3.1 (i.e. bid truthfully for the desired set). We will 
now show that from each of the resulting 4 possi- 
ble declaration profiles, some player has incentive 
to change their declaration. 

Suppose that player 1 bids for set {d} and player 
2 bids for {6, c}. (Note that under the assumption 
that all declarations start empty, we can reach such 
a state by selecting agents for update in the or- 
der 3,4,1,2,1). Then player 1 has no incentive to 
change his declaration, but player 2 would benefit 
by changing his bid to {a} (increasing his utility 
from 1 to 2). From that state, player 1 then has in- 
centive to remove his bid for {d} and instead win 
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set {a, b}, also increasing his utility from 1 to 2. If 
he does so, player 2's utility becomes 0, so player 
2 benefits by switching back to his bid for {6, c}. 
This results in player 1 winning nothing, so player 
1 gains by adding back his bid for {d}. We have 
returned to the original state, and hence no reach- 
able state forms an equilibrium. 

Appendix D: Proof of Lemma 4.2 

Recall the statement to be proven. We suppose 

that, for any d, if Pi(d_i) is true, then the best 
response by agent i, di, satisfies P2{di). We then 
wish to show that, for all e > 0, if best-response 
dynamics is run for T > e^^n steps, there will be 
at least (i — e)T steps t for which either Pi (d^ J 
is false or P2{di*) is true, with probability at least 

Proof. For each t, let B| be the event that neither 

Pi{d*_i) nor P2{d.*) is true. Let A* denote the 
event that P2{d/) is true. Note that Al and B* are 
mutually exclusive. Consider the steps in which 
either A* or _B* occurs: let Cr denote the event 
that Bj occurs on the rth step in which either 
or Bj occurs. We can think of the index r as repre- 
senting time steps, where we skip any time step in 
which neither A* nor Bj occurs. We wish to show 

PriErKT^r > + e)T] < e-^^V32n^ ^J^ich 

implies our desired result. 

Suppose that event Cr occurs; this implies that 
Bj occurs, where t is the rth step on which either 
Al or B* occurs. With probability -i- agent i is 
chosen for update on step t + 1. Conditioning on 
that event, Af^^ occurs (by the assumption of the 
Lemma), and hence C^+i does not occur. We con- 
clude Pr[a+i|Cr] < (1 - 1/n). 

Next suppose that event Cr does not occur; this 
implies that Aj occurs, where t is the rth step on 
which either Al or Bj occurs. With probability 
(1 — ^) agent i is not chosen for update on step 
t + 1. Conditioning on that event, A^^^ occurs 
(since P2 depends only on the declaration of agent 
i, which does not change), and hence C^+i does 
not occur We conclude Pr[CV+i l-iCr.] < 1/n. 

Let -Di, £>2, ■ • ■ , P't be a random walk on 
{0,1} defined by Pr[Dr\Dr^i] = (1 - 1/n), 
Pr[Dr\^Dr-i] = 1/n, and initial condition 
Do ■ Then Cr is stochastically dominated by 
J2r Dr, and hence Pr[E, > (1 + e)T] < 



■P^ErA- > + e)T]. It will therefore suf- 
fice to show that Pr[J2r<T^r > (5 + e)P] < 

g-Te^/32n 

The definition of Dr yields 

Pr[Dr] = ^(1 - Pr[Dr]) + (1 - ^)Pr[D,] 

= - + (l--)Pr[D.]. 
n n 

Solving the recurrence (with initial condition Dq) 

yields 

Pr[Dr] = i(l - (1 - 2/ny) + Do{l - 2/ny 



Linearity of expectation then implies 
ElY^Dr] = lT+{Do-b^{l-{l-2/nf-'). 



2 ^ " 2'2 

r 

From this we conclude that 



E[J2Dr]<lT + 



(1) 



and moreover 

E 



J2Dr\Do = l 



E 



(2) 



< 



Let k = T/n and define random variables 
Fi,...,Fk by P, = Y.re[^n.(^+l)n-l] ^r- Then 
Fi G [0,n] for all i, and J2rDr = Er^r- 
Furthermore, the influence of Fi on Pi+i, . . . ,Fk 
is captured entirely by the value of 
and from (2) the influence of £)(j_|_i)„_i on 

Er=(«+i)„ = Er=i+i Pr is bouuded by f . 
Since the value of Fi also influences the sum 
Er Pr directly by at most n (due to its being in- 
cluded in the sunomation), we conclude that for all 

a, a' G [0, n]. 



E 



E 



J2Fj\Fi,---,Fi-i,Fi=a 

Fj\Fi,. . . , Pi_i , Fi = 



< 3n/2 
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Thus, by the method of bounded average differ- 
ences, we conclude that 



Pr 



<e 



3 13 
-(Te/2f/2(3n/2fk 



+ (e/2)T 



< e 



-Te^/32n 



Since T > e-^n, we have that E[J2j Fj] + 
(e/2)r < It + f + (6/2)T < (i + f )r. Thus 



Proof (of Proposition 4.5). Note that 

agent i would obtain utility at least ^ti{Ai) by 
making a single-minded declaration for set Ai 
at value ti(Aj). Thus his utility-maximizing 
declaration must make at least this much utility, 
and therefore is a bid for some set Si with 
ti{Si) > ^ti{Ai). The result then follows from 
Lemma 3.1. □ 



Pr 



result follows. 



Ej Fj > {l + l)T < e"'^" and the Proof (of Lemma 4.6). J^e begin by showing that 



□ 



Appendix E: Omitted proofs from Sec- 
tion 4.1 

Proof (of Lemma 3.2). Let d' be the single- 
minded declaration for set Ai at value ti{Ai). 
From the definition of regret minimization, 

1 J2u,{d,\ dU) >Lj2^,{d', dU) - 0(1) 

t t 

>^EO^(^^)-^^''(^-d*_,))-o(l) 
t 

= ii(^i)-^E^^^(^-d*-)-«W- 
t 

Sinceui(di*,diJ < ii(>l(d*)) for all t, the result 
follows. □ 

Proof (of Lemma 4.3). We will prove the follow- 
ing claim, which immediately implies the desired 
result due to our assumption that the initial state 
of best-response dynamics is the empty declaration 
profile (which is separated): 

Claim E.l. Ifd is separated, then it remains sep- 
arated after a step of the best-response dynamics. 

Suppose agent i is chosen to update his bid, say 
from di to d/. Let d' = ((i/,d_i). If agent 
i cannot improve his utiUty then d' = d, so d' 
is separated as required. Otherwise, he changes 
the set upon which he bids from, say. Si to S[. 
Since Ui{di , d_i) > 0, it must be that di{Si) > 
EjeKi(d Si) ^3 i^j)' which implies d' is separated 
for agent i and di{Si) > dj{Sj) for all j such that 
Si n Sj ^ 0. Hence, for all j ^ i, we have 
Qj{d',Sj) C Qj{d,Sj). Thus, since d is sepa- 
rated for all j ^ i, d' must be separated for each 
j ^ ias well. □ 



SWM.cAi'i) > hE^MS^)■ Recall that if 
set Si is allocated by AdsCA, then di{Si) > 
J2jeRi{d Si) '^ji^j)- ^ — N he the set of 
agents that receive non-empty sets in A4sCA{d). 
Proposition 4.4 implies that for all j E [n] there is 
some i E N such that Si intersects 5*^ . Then 

Y,di{Si) = E d^{Si) + MSi) 

i iGN i^N 



<SWM,cA{d) + Yl E d^(^^) 

ieN jeRi{d,Si) 

<SWM,cA{d) + J2diiSi) 



2SWM.,cA{d) 



and therefore SWm.ca (d) > ^ Ei MSi)- 

Now let Gi denote the set of agents for which 
di{Si) > ■^ti{Ai), and let G2 denote the set of 

agentsfor which X;jgfl.(d,Ai)C^j('S'j) > 5*i(^i). 
so that G = Gi UG2. 

We note that 5WAi.cA(d) > ^EidiiS^) > 

EieGs Ejefi^(d,Si) '^i('^i) - 2 E^eCa *»(^»)- 
Since Sj can intersect at most \Sj\ < s sets Ai , we 

conclude .sX;jdi('S'j) > i E^eCa *»(^»)- ^his 
implies (4s)51^^,^^(d) > Eiec. tiiM 

We conclude (4s -|- 4)SWMsCA{d) > 
E^GGi di{St) + EieG2 M^i)' which implies the 
desired result. □ 

Appendix F: Proof of Theorem 4.8 

Recall the definition of a separated declaration 
profile from our analysis of MsCA- Lermna 3.1 
and Lemma 4.3 apply to Mca for the same rea- 
sons as AdsCA, so we will assume that all decla- 
rations in an instance of best-response dynamics 
are separated, and are single-minded declarations. 
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Also, we can assume that all single-minded decla- 
rations are for sets that are either M or of size at 
most ^/m, since these are the only sets allocated 
by McA- 

Proposition F.l. If d is separated, then for all j 
such that Sj ^ 0, there exists some Si such that 
di{Si) > dj (Sj), M.CA allocates Si to agent i. 

Proof. If McA allocates M to some bidder than 
the result is trivial, so suppose not. If Sj = M, 
then the result again follows trivially. If 15*^1 < 
^/m, then, from the perspective of agent j, Mca 
behaves precisely as M^ca> so the result fol- 
lows from Proposition 4.4. □ 

Fix some separated declaration profile d. We 
now wish to characterize the utility-maximizing 
declarations of an agent i. We begin with the sim- 
ple case of declarations for set M. 

Lemma F.2. d_i) < jti{M), then 

the utility -maximizing bid for agent i, di, sets 
di{Si) > ^ti{M) for some Si. 

Proof We claim that 6'f^'^(M,d_i) < ^ti{M). 
To see this, observe that since SWmca (^j d_ j) < 
jti (M) and d is separated, it must be that no agent 
(other than i) bid for M at a value greater than 
jti{M), and that the sum of all bids for M (by 
agents other than i) must be at most twice jti (M). 
The threshold amount for allocating M to a single 
bidder can also be at most SWmca{^j d-i)- So 
6'^^^-*(Af,d_,) < 2 {jU{M)) as claimed. 

This implies that agent i can make a utihty of 
^ti{M) by bidding on set M. Thus his utiUty- 
maximizing declaration must be a bid for a set Si 
with ti{Si) > ^ti{M). The result then follows 
from Lemma 3 . L □ 

Next consider allocations of small sets. Let 
Ai, . . . , An be the optimal allocation of sets of size 
at most ^/m. Recall the definition of Ri from our 
analysis of MsCA in the previous section. 

Lemma F.3. // E,efl,(d,A,) '^^('^j) < 

then the utility-maximizing bid for agent i, di, sets 

di{Si) > \ti{Ai) for some Si. 

Proof ( of Lemma F.3). Recall the definitions of Ri 
and Qi from our analysis of AdsCA in the previ- 
ous section. For ease of notation we will write 



SW^^ for SWm^ca ^^^s proof. We begin 
with a technical lemma that bounds the effect of 
a single agent's bid on the social welfare obtained 
by McA- 

Claim F.4. Suppose d is separated and 
M VrnCA(d) allocates Si to agent i. 
Then di{Si) - Y.j(^R,(d,s.)dj{Sj) < 
SW^{di,d.i) - SW^{$,d.i) < di{Si). 

Proof Let Ti , . . . , T„ be the sets allocated 

in M^ca{^A-i), and let Ui,...,Un 
be the sets allocated in M^cA{di,(i-i)- 
Thus SWM^cAi'^i^^-i) ^ Hjdj{Uj) and 

SWM^CAi^A^^)=T.Jd,{T,). 

Since Si is allocated to agent i, it must be that 
di{Si) > ^j(zR.(^d,Si)dj{Sj). Let us first con- 
sider the case that di{Si) = J2jeRi(d Si) dj{Sj). 
Note that, for any j, if Tj ^ and Uj = 0, then 
it must be that there exists some agent k such that 
j e Qkid, Sk) and Uk ^ (by Proposition F.l). 
Thus, summing over all j, it must be that 

k 

Since Ui = 5i, if we were to increase the 

value of di{Si), this increases J^k dk{Uk) by the 
same amount. Thus, if we write x = di(Si) ~ 
EjeK.(d,s.)'^j('^^)' becomes J2kMUk) - 
^ > J2j dj{Tj). Rearranging gives J2k dkiUk) - 
Ej dj{Tj) >x = diiSi) - EjeR.(d,s.) djiSj), 
which gives one half of our desired inequahty. 

For the other inequality, suppose for contradic- 
tion that SW^{di,d-i) - 5VK^(0,d_i) > 
di{Si). Define d' by setting d'^ = dj when 
S■jr^S^ = 0, and d'^ = when S'^nS'i ^ 0. That is, 
d' is d with all bids intersecting Si removed. As 
we showed in (3), replacing any winning bid with 
can only decrease the social welfare obtained by 
^^frnCA- This implies (by removing bids one at a 
time) that SW^{%, d_,) > SW^{%, d'_,). 

Note that allocation M^rf^cAidiTd^i) is pre- 
cisely {Si} U M^ica{^^ We conclude that 
5t^^K,d_,) - 51^^(0, d_,) = d,{Si) - 
(5W^(0,d_,) - 51^^(0, d'_,)) < di{Si), 
as required. This completes the proof of Claim 
F.4. □ 
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We now proceed with the proof of Lemma F.3. 
Suppose for contradiction that di is a bid for Si 
with dj(S'i) < ti{Ai)/2. Then it must be that Si 7^ 
M, since ti{M) > ti{Ai). Let d/ be the single- 
minded bid for Ai at value ti{Ai). 

We now consider cases depending on whether 
not McA{di,d^i) and/or McA{di ,d-i) allo- 
cate M to a single bidder. If McA{di,d^i) 
and McA{di ,d-i) either both allocate M to 
one agent, or neither do, then Ui(d/,d_i) > 
Ui{di, d_j), a contradiction (where the case when 
they both allocate M relies on the e-possibility that 
large bids are ignored). 

Suppose McA{di ,d-i) allocates M 
to one agent, but M.cA{di,d-i) does 
not. Since neither Ai nor Si is M, it 
must be that SWM^cAidi\^-i) < 
SWM^cAidi,d-i). However, by Claim F.4, 

SWMZcAid/,d-i) - SWM^oAi^^d-^) > 
d/{A,) - EjeR.idA,)d3{S3) > U{A,)/2, 
and SWM^{d^A-i) - SW^^(0,d_,) < 
di{Si) < ti{Ai)/2. ThusSWM^cA{di,d-i) > 
SWm^ca [diid^i), a contradiction. 

Suppose instead that M.cA{di,d-i) allocates 
M to one agent, but M.cA{di ,d-i) does not. 
Then it must be that agent i obtains utility at least 
ti{Ai)/2 when bidding for di , and (since Si ^ 
M) he obtains utility at most eti{Si) < eti{Si)/2 
when bidding for rf,;. This contradicts the assumed 
maximality of di. We have thus reached a contra- 
diction in all cases. □ 

We are now ready to complete the proof of The- 
orem 4.8, in a manner similar to Theorem 4.7. 

Proof (of Theorem 4.8). Since each d* is sepa- 
rated, it is clear that SWmca{'^^) > Ai^i) f^"" 
all t. Applying Lemma 4.2 to Lemma F.2 there- 
fore implies that at for least — e)T steps of D, 
SWmca{'^*) > \ti{M). We conclude that, with 
probability at least 1 — ne~'^'^ the average 
welfare obtained by M.ca is within a constant fac- 
tor of maxj ti{M). It must therefore also be an 
0{^/rn) approximation to the optimal welfare at- 
tainable through any allocation of sets of size at 
least y/m (as there can be at most ^/m such sets in 
any valid allocation). 

Now let Ai, . . . ,An be any allocation of sets 
of size at most ^/m, and for each t let Gt be the 



set of agents for which either di*{Sl) > ti{Ai)/2 
T.jeRM',si)dj{S*) > ^U{A,). Applying 
Lemma 4.2 to Lemma F.2, we see that for each 
agent i appears in Gt for at least — e)T time 
steps, with probability at least 1 — ne"^*^^/^^". 
The same argument as in the proof of Lemma 
4.6 then demonstrates that A4ca obtains an aver- 
age 0{y/m) approximation to J2i ti{Ai), over the 
steps of the best-response dynamics, with proba- 
bihty at least 1 - ne"'^^^/^^". 

Taking the union bound over the events de- 
scribed above, we conclude that Mca obtains an 
0{y/rn) approximation to any allocation of sets 
with size at most y/m, and an 0{y/m) approxima- 
tion to any allocation of sets of size at least y/m, 
with probability at least 1 — 2ner^'^ Since any 
allocation is a combination of sets of size at most 
y/m and sets of size at least ^/rn, we conclude that 

SWmca{D) > (^^^^ ^ e) SWoptit) with 
probability at least 1— 2ne~^^^/", as required. □ 
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